The Use of a Commercial Natural Language Interface in the ATIS Task
نویسنده
چکیده
A natural language interface for relational databases has been utilized at AT&T Bell Laboratories as the natural language component of the DARPA ATIS common task. This is a part of a larger project that consists of incorporating a natural language component into the Bell Laboratories Speech Recognizer. The commercially available system used in this project was developed by Natural Language Incorporation (NLI), in particular by J. Ginsparg [Ginsparg, 1976]. We relate our experience in adapting the NLI interface to handle domain dependent ATIS queries. The results of this allowed the exploration of several important issues in speech and natural language: 1. the feasabilitiy of using an off-the-shelf commercial product for a language understanding front end to a speech recognizer, 2. the constraints of using a general-purpose product for a specific task. 1 I n t r o d u c t i o n The ATIS common task was designed by DARPA and the members of the DARPA community to build and evaluate a system capable of handling continuous and spontaneous speech recognition as well as natural language understanding. Although the evaluation task is still not fully defined, the ATIS common task presents the opportunity to develop reliable and measurable criteria. The present paper focuses on the natural language component only, the integration with speech being reported in other papers [Pieraccini and Levin, 1991]. The domain of the task is on the Air Travel Information Service (ATIS). The project touches on a wide range of issues both in natural language and speech recognition, including incorporation of an NL interface in speech understanding, flexibility in the type of input language (i.e. spoken or written), relational databases, evaluation of system performance, possible limitations, and others. The NLI system 1 is driven by a syntactic parser designed to handle English queries that are characteristic of the written language. In contrast, ATIS syntax is characteristic of spoken and spontaneous language. Therefore, one of the primary questions in using the NLI system has been how to overcome problems related to the discrepancy between written and spoken language input. Issues related to the ATIS domain and queries on the one hand, and to the construction of the NLI interface on the other hand are addressed. The task of the experiment is then described along with the results. 2 W h y use a c o m m e r c i a l product? Using a commercial product is attractive for a number of reasons: within Bell Laboratories, there has been no effort so far to develop a natural language interface (although this may change). Therefore, it is a significant savings of time and effort to use a publicly available system in order to achieve the larger task, that is the integration of speech and natural language. within the task of language understanding, the use of a natural language interface meant to understand written language input, exposes issues specific to speech incorporation. 3 NLI s y s t e m d e s c r i p t i o n The NLI system is composed of a series of modules, including a spelling corrector, a parser, a semantic interface consulting a knowledge representation base, a con1The acronym NLI should not be confused with the suffix of the transcription sentences ".rdi", meaning natural language input.
منابع مشابه
Iranian Advanced EFL Learners’ Awareness and the Use of Marked Word Order: Discourse-pragmatically Motivated Variations
The present investigation was designed to study the production and comprehension of specific means for information highlighted by advanced Iranian learners of English as a Foreign Language. The study focused on the discourse-pragmatically motivated variations of the basic word order such as inversion, pre-posing, it- and Wh-clefts. After taking the Nelson test, a homogeneous group was settled. ...
متن کاملManagement and Evaluation of Interactive Dialog in the Air Travel Domain
Introduction This paper presents the Unisys Spoken Language System, as applied to the Air Travel Planning (ATIS) domain. 1 This domain provides a rich source of interactive dialog, and has been chosen as a common application task for the development and evaluation of spoken language understanding systems. The Unisys approach to developing a spoken language system combines SUMMIT (the MIT speech...
متن کاملCombining Linguistic and Statistical Technology for Improved Spoken Language Understanding
SRI has developed a spoken language interface to the Official Airline Guide (OAG). Despite a funding gap for more than four months of the year, substantial improvements have been made in the component technologies. On recent ARPA benchmarks. SRI achieved 5.5% word error on the ATIS speech recognition task, 18.2% utterance error on the natural-language understanding task, and 20.7% utterance err...
متن کاملDARPA February 1992 ATIS Benchmark Test Results
This paper documents the third in a series of Benchmark Tests for the DARPA Air Travel Information System (ATIS) common task domain. The first results in this series were reported at the June 1990 Speech and Natural Language Workshop [1], and the second at the February 1991 Speech and Natural Language Workshop [2]. The February 1992 Benchmark Tests include: (1) ATIS domain spontaneous speech re...
متن کاملA stochastic case frame approach for natural language understanding
A stochastically based approach for the semantic analysis component of a natural spoken language system for the ATIS task has been developed. The semantic analyzer of the spoken language system already in use at LIMSI makes use of a rule-based case grammar. In this work, the system of rules for the semantic analysis is replaced with a relatively simple, first order Hidden Markov Model. The perf...
متن کامل